New Results in Linear Filtering and 
Prediction Theory' 

A nonlinear differential equation of the Riccali type is derived for the covariance 
matrix of the optimal filtering error. The solution of this "variance equation" com- 
pletely specifies the optimal filter for either finite or infinite smoothing intervals and 
stationary or nonstationary statistics. 

ualt i i > ' i e Ifami in anonical) differentia 

zqua e calculus of variat < 1 n some case 

e variance eq » is Hit ed by exa hi dup it 

simplify, or extend earlier results in this field. 

j , i i u i imahan and deiermtm *t > ' ' 

problems plays an important role in t < > rehcal results. In several examples, 
the estt , mat a iss ie-hy-sidt 

Properties of the variance equation art of great interest in the theory of adaptive 
systems. Some aspects of this are considered briefly. 


1 Introduction 

At phessbht, n nonspecialist might welt regard the 
, , s ii r henry ring a rediction [ J, 2] 2 as 

'classical' - in short, a field where the techniques are well 
established ami only minor improvements and generalizations 
eaa be expected. 
That this is not really so can be seen convincingly from recent 

Its of i t (;?], Meeg [-1 " t 1 > r and l 
i y i \ t ) i these in rs 1 

ved Mini iii iii ik' n 

prediction theory. Wo present here a unified account of our own 

(dependent . hes during I < vo years i i ' ,v< i 
with much of the work (3-7] just mentioned), as well as numerous 
new results. We, too, use Uuu;-domain methods, and obtain 
najor improvements and generalizations of the conventional 

tii tn particul i • , 

modification to multivariate problems. 
The following is the historical background of this paper. 
InanesUnM m of the sta t'ianer filtering pro Poll 

IS; obtaim i i lalionshspa between time varying gains an.* error 
variances for a given circuit configuration. Later, Hanson (91 
proved U I t ■configuration was a ptimal 

fot Uu as urie e stntistb a nn.it.,- er, hf showed that the difff w n- 
■ equations U I -i.it (hi tbtniw y Feilin 
r . in > " in ft . quatioi "i - il 

were then generalized by Buc; [10] who found explicit rela- 
tionships between the optima! weighting functions and the error 
variances; he also gave a rigorous derivation of the variance 
equations and those of the optimal filter for a wide class of non- 
t i -r - l ii, . i lis* statist 
Independently of the work juat mentioned, Kfttaan [11] gave 
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a new approach to the standard filtering and prediction problem. 
The novelty consisted in combining two well-known ideas: 

(i) the "state-transition" method of describing dynamical sys- 
tems [12-1.4], and 

j i t t 1 i \ t t s »i i i > r » i >j t n ' 1 « 

apace [15, pp. 150-155]. 

As an important by-product, this approach yielded the Duality 

„ i Mil bieh provid i link between (etacha 
filtering theon i tern ii is) ) t tro <>« • Beca af of 
the duality, results on the optimal design of linear cord/o! systems 
[13, 16, 17] are directly applicable to the Wiener problem :«0)u&i- 
ity plays an important role in this paper also. 

Vhs it ' n t each other's wort was 

sex i i :» that the pi i ,p d conclusion of both investigations 
was >i i ii (Terence in i Is 

Rather than to attack the Wimer-Hopf integral equation directly, 
ii is better to convert ' ii '« a nonlinear differential equation,, whose 
solution yield* ins covariance matrix of the minimum filtering error, 
u hi • i turn co tains nil necessary information for the design of the 
optimal fdter, 

2 Summary of Results: Description 

The problem considered in this paper is stated precisely in 
Section 4. There are two main assumptions: 

(Ai) A sufficiently accurate model of the message process is 
given by a linear (possibly time-varying) dynamical system 
excited by white noise. 

V, nil, i m addttiv 'urn 

Assumption (As) is unnecessary when the random processes in 
question are sampled (discrete-time parameter); see [11]. Even 

n the cotit i s-tinx s 1 < « r a) * i since it can 
be removed in various ways aa will be shown in a future paper. 

issumi owever, is quite has i naiogoi o but 

somewhat less restrictive than the assumption of rational spectra 
in the conventional theory. 

| I < - million he best 1 late of 

he message based ig in either a finite or infinite 

time*interval. 

| < , » nsist of 0\ 

equations: ', 
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(I) The differential equation governing the optimal filter, 
which is excited by the observed agnate and generates the beat 
linear estimate of the message. 

(II) The differential equations governing the error of the beat 
linear estimate. 

(HI h tii i t pi ' iu i h i .» . tei spressea n 
terms of the error variances, 

I Tl lit ! * 11 8 ,h 

variance matrix of the errors of the beat linear estimate, called the 
variance- equation. 

(V) The formula for prediction. 
The solution of the variance equation for a given finite tiroe- 
I i „ ! lution of the estima ir pi 

iction problem with resj < me time-inten 0 

steady-state solution of the variance equation corresponds to 
finding the best estimate based on all the data in the past. 
As a special case, one gets the solution of the classical (station- 
Wiener problem riding i ' 

the variance equation. This requires solving a set of algebraie 
equations and constitutes a new method of designing Wiener 
filters. The superior effectiveness of this procedure over present 
methods is shown in the examples. 

Some of the preceding ideas ate implicit already in [10, 1X1; 
they appear here in ft fully developed form. Other more ad- 
vanced problems have been investigated only very recently and 
provide incentives for much further research. We discuss the 
following further results: 

(1) The variance equations are of the Riccati type which occur 
» < ale i - ) i i > i r to the can 

differential equations of Hamilton. This relationship gives rise 
to a well-known analytic formula for the solution of the Riccati 
equation [17, 18]. The Hnmilloma! equations havt also been 
used recently {191 in the study of optimal control systems. The 
two types of problems are actually duals of one another as men- 
tioned in the Introduction. The duality is illustrated by several 
examples. 

(2) A sufficient condition for the existence of steady-state solu- 
tions of the variance equation (i.e., the fact that the error variance 
docs not incrr i is that, t fo n matrix it 
t„ ris fRJ Fiiher [21!] be nonsingular l\ conditio s 
c sidembl eaker than the usual waif - meesag 
process have finite variance. 

(3) A sufficient condition for the optimal filter to be stable is 
the dual of the preceding condition. 

The preceding results are established with the aid of the "state- 
transition" method of analysis of dynamical systems. This con- 
sists essentially of the systematic • - 

which results in simple and clear statements of tiie main results 
independently of the complexity of specific problems. This is 
the reason why multivariabie filtering problems can be. treated by 
our methods without any additional theoretical complications. 
The outline of contents is as follows: 

In Section 3 we review the description of dynamical systems 
nt of vi 5 4-5 contain precise statf 

] Iter r tt it'- i i . , r i 

t„ m i Secti 1 te the filtering pi in an 
dual t" conventionnt block-diagram terminology. Section 7 con- 
tains a precise statement of ail mathematical results. A reader 
nterestx ii tiw > ' 1 

to the work - mil igorous deriva 

i of t lamented equal - ' ion 8. S 

u .(,,. :6 proof based o ' I) li I'rindpl. >' tlx « > st >< 
, 1 ,i , t ,i ... ition. Tht 

, analvdc wdut n? f tl» irknc i »twn Is discussed in 
Section K In Section 1 examine brie! let »tion o < 
results to adaptive filtering problems. A critical evaluation of 
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the current status of the statistical filtering problem is presented 
in Section 13. 


3 Preliminaries 

In the main, we shall follow the notation conventions (though 
not the specific nomenclature) of [11], {161, and [21J. Thus r, I, h 

refer U tl th te, <f3 , *„ xt, ■ ■ -, <*», & > *>>'< " a ' e 

■ real . « 1 xrs; a b x, y, . • , «, *, ■ • • »™ peters, A, B 

IV... are matrices. The prime dei rathe tra 
matrix; thus x'y is the scalar {inner) product and xy' denotes 
a matrix with elements i,y, (outer product). \\x\\ - (x'*> 1 » 
the euclidean norm and |N!«A (where A is a nonnegative definite 
n itr % is the quadratic form with respect to A. The eigenvafe 
of a matrix A are written as X,(A). The expected value (as- 
semble average) is denoted byS (usually not followed by bracket*). 
The eovarianee matrix of two vector-valued random variables 
x«), y(f) is denoted by 

Sx«)y'(r) - 8x(t)8y f (r) or covtx(l), y(r)) 

depending on what form is more convenient. 

Real-valued linear functions of a vector x will be denoted . 
x*; the value of x* at x is denoted by 


where the *H are the co-ordinates of x. As is well known, x* • 
be regarded abstractly as an element of the dual vector tpae* of liKi 
x's; for this reaso n, x * is called a eoeector and its co-ordinates sh» 
the'zV In algebraic manipulations we regard x* formally 
row vector (remembering, of course, that x* * *')■ Thus 
inner product is x*y" and we define |jx*jj by (x*x* ) Air r 
S[»», xP - S(x*x)' - Sx'xx'x" 

= x*(6xx'5x- = |**|W 

To establish the terminology, we now review the eseentfci'i 
the eo-called state-transition method of analysis of dynaso ri 
aystems. For more details see, for instance, 1211. 

A Unear dynamical system governed by an ordinary dBcw.i_4 
equation can always be described in such a way that the o., 
ing equations are in the standard form: 

dn/dt - F(l)x + CKiMO 03 

where x is an n-voctor, cal I " '' ' ' " ' "'*'' 1 ' 

<■ i. « state mriabU ufi) is an i r ! d 1 

function; F(t)and<3(t)aren X nam > 
whose element* an continuous function* of «>' '•>',, « 

The description (I) is incomplete without specify*"* the vuM- 
ptd y(0 of the system; this may be taken as a p-vector whoae 
components are linear combinations of the state variables : 

y(t) = H(f)x(0 & 

wi • i'( H '. is a | > •» »tri> " ntimi atsin i 

The matrices F, G, H can be usually determined by in»i • 
if the system equations are given in block diagram form. .> 
,, . an (« in - ctioi t iwldl* emerobc d thai . 

. a ri ps may be i i gutar I present he dynaf 
the constraints on affecting th ite of v ml inpu 
H the constrai on observi he stai f tl •".» 

For sine input aje-output ' G and H coosH 
of a single column and single row, respectively 

If F, G, H art on taut ' - If «(0 * u 

, , . 3^0 is saiii to ( ret 
H W well known S.21-2 ba« i <*•» turn of (l)n»>" 

be written in the form 

Transactions of the ASSiE 


unit matrix (5) 
mediate by the existence and 

«I»-'(f„ e.) - 4»(fc, (,) far ail U, h (65 
fc) - f»)*(«t, <t) for all fc, c,, k (7) 

n be represented by 


k) - rxp F(t - 


t < i i i computations. I 

.rjA>ciai ease, one can also express * analytically in terms of the 
. tenvalues of f, using either linear algebra 1221 or standard 
•rsnifer-i unction techniques (14|. 

!- i t is i . t • I right-hand s 

•3) fey a notation that focuses attention on how the state of the 
*' >"< • ' > %i wee as a lion of tin < Thus 
ms write the left-hand «dc of Ci) m 


u) 


cov fu«}, «{r)i - Q(l)-6(t - r) 

cov {»«), v(r)J - R(t)-fi(t - r) for ail t, r (12) 
cov Mt), v(t)1 - 0 
where 5 is the Dtrac delta function, and Q(t), R(t) are symmetric, 

nnegative denude matrices < tinuoti iifferentiai in i 
We introduce already here a restrictive assumption, which is 

ti for the «s hi g the n ticai tev fopments: 
(A,) The matrix R(t) is positive definite for all t. Physically, 

this means that no component of the signal can be measured 

exactly. 

random process x(i) uniquely, it is 


m - fl m *(*. T)G{r) U (T>iT 


to add a further assumption. This may be done in two different 

(A,) The dynamical Bystem (10) has reached "steady-state" 
under the action of u(<), in other words, x(t) is the random func- 
tion defined by 

(13) 

This formula is valid if the system (10) is uniformly asymp- 
totically stable (for precise definition, valid also in the noncon- 
stant case, sea [21]), If, in addition, it is true that F, G, Q are 
constant, then x(i) is a stationary random process — this is one of 
the chief assumptions of the original Wiener theory. 

However, the requirement of asymptotic stability id incon- 
• , eat hi . j*.. < j :. Foi ' ince, iti no Rati I 1 i Example 
5, which is a useful model in some missile guidance problems. 
Moreover, the representation of random functions as generated 
by a linear dynamical system is already an appreciable restriction 
and one should try to avoid making any further assumptions. 
Hence we prefer to use: 

(A,') Tine measurement of x(t) starts at some fixed instant U 
of time (which may be - «> }, at which time eovlxifc), x{U)\ is 
known. 

Assumption ( Ai) is obviously a special case of (t^t')- Moreover, 
since (10) is not necessarily stable, this way of proceeding makes 

1 to e i i I the. message variant 

grows indefinitely, which w excluded in the conventional theory. 
The main object of the paper is to study the 
OPTIMAL ESTIMATION PROBLEM. Owen known values 
of z(r) in the lime-interval k 5 r £ t, find an estimate x(h\t) of 
x(h) of the farm 

(9) k«,|£) - V A(tt, rWr)Jr (14). 


«ad rhesiat ,>i fh at m *t lme i, evolving from the 
;,ual state x = x«„} at time U, under the action of & fixed forcing 
faction a(i). For simplicity, we refer to <f» as the motion of the 
- namical system 

Statement of Problem 

W* shall be concerned with the continuous-time analog of 
Voblwn I of reference [11], which should be consulted for the 
;>hyseai motivation of the assumptions stated below. 

(Ai) The mtssuye is a random process x(t) generated by the 

dx/di - F(i)x + G(t)ki) (10) 
oi - •• i signal s 

m =• y(0 + v«) - H{i)*(t) + v«) 

"Om functions u{i), v(<) in (10-11) are independent random proe- 
«osm {white noise) with identically zero means and covariance 
'Matrices 


(where A is an n X p matrix whose elements are eonttn 
> 1 entit to with the property that the ext 

squared error in estimating any linear fimclian of tlte message i 
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811*0.) - gulf* 
(6) In view of (Ai), it is clear tliat Sxfi,) - 6x«,|i) = 0. 
Hence [**, i(h\t)] is the minimum , variance linear unbiased 
estimate of the value of any cosiale x* at x(U). 

(c) If 8u(«) is unknown, we have a more difficult problem which 
will be considered in a future paper, 

(d) It may be recalled (see, e.g., {11 J) that if a and v are 
gaussian, then so are also x and %, and therefore the beat estimate 
will be of the type (14). Moreover, the same estimate will be best 
not only for the loss function (15) but also for a wide variety of 
other loss functions, 

(e) The representation of white noise in the form (12) is not 
rigorous, because of the use of delta "functions." But since the 
delta function occurs only in integrals, the difficulty is easily re- 
moved as we shall show in a future paper addressed to mathema- 
ticians. VII lerm "j ieal de\ pi - j < e pa] 
are rigorous. 

i i - t i i 1 i i r assumptions 

(As), (A a ), (A,') is stated in Section 7 and proved in Section 8. 

S The Dual Problem 

It will be useful to consider now the dual of the optimal estima- 
tion problem which turns out to be the optimal regulator problem 
in the theory of control. 
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(16) 


FW = F'(0 
G*{<*) = H'(0 
H*(C) = G'(i) . 

Let <*»*((*, U*) be the transition matrix of the dual dynamical 
system of (1): 

d**/df - f(f)x* + G*((*)u*(t*) (17) 
It is easy to verify the fundamental relation 

to*) - *'{<* i) (18) 
With these notation conventions, we can now state the 
OPTIMAL REGULATOR PROBLEM. Consider the linear 
i r wtia " ' i (if l „ * 

»*(;♦) - k'u*(t*), t**) (is) 


"-/fill*** 


' li «l> *(««*; x, t 


This 


i i ;li-k:iowi 1 t 

the optimization of a regulator with integrated-squared-error 
t; 1 1 pe ,' -• i oirc index, 

i S ryofth regulator probl 

I 1 " ir il [li it i , ~ tit* CB > 

i m , bh'iiu problem because of the 

>l fTVTin:ORl The wlutims cf ike eptitKuI estima- 
tion problem and of the optimal regulator problem are equivalent 

tfoj J| I ' fftrffOTM (16). 

I li f ( i ,, s * , , l 

Ihrt i r , . 

i ' ion problem i i,i h * , 

'" ' ']''< • th. « tti iitj U irem is eorreet of tht - >h - 

' 1 ' 1 ' t 1 ' c i tor pro . . i words, the 0} 
'i icontroUawk* roust be a li ir fm on of x* 

The first proof of the duality theorem appeared in (11], and 
consisted of comparing the end mailt* of the solutions of the two 
problems. Assuming only that the solutions of both problems 
result in linear dynamical systems, the proof becomes much 
simpii i i , • •» - ■ . , , . t « , ,, ,1 „ ,* , 

detail in (16). 

Ren an i <■< K 

the extent of rt I iiijt i« first it - ! in (20) by 

l!y*(r*) - y,*(r*f ck,.) 
where y/«*) ?rf 0 is the desired trnlpid (in other words, a* the 
regulator problem is roplnrai by a «:rvonieciianism or follow-up 
- ! ' ' ' > iftvo I . • , . maiion problem wit] 

fiu(() 0. 

6 Examples: Problem Statement 


for.-n.ili general J 

e present here twm ap • > ic pr >l>leri « in 


imnolog 


1 Of thus. 


problem; is given in S 

JixampU I. Let tb« model of the, message process be a first- 
1 b. iuic m it i» not assumed 

'I i is the simplest problem 

i '» i , i . . J , a re t by Wiener iiii 

|1, pp. 91-92). 



fig, t Example 


The model of the message process is shown to Fig. 1(a). 't ,.g 
various matrices involved are aJl defined by 1 X 1 and are 

F(0 - Lf»), G«) - HI, H(f) - |i], 

QCO - h»\, R(0 = Ir,,}. 


dx*,/dt* - /„x»» + »*,(»•>, itVO = i*.(0 
and the performance index is 

J]^* {?i.Ix',(t»))« + m[tt*dV)!«|(ir* . « 

j1 e ersion of t mat! roblem was tr > 

in 111, Example 1). The dnal problem was treated by Rozoisr S 
[MI. 

Example $, The message is generated as in Example 1, * 
now it is assumed that, two separate Fieisal? (mixed with • .- 
ferent noise) can be observed. Hence R is now a 2 X 2 m, • -. 
and we assume that 

-m 

The block diagram of the model is shown in Fig. 2(a). 
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if i , i - t I b put tint: white > 

through the tr, t 1 k diagram of 

the model is shown in Fig. 3(a). The system matrices are: 

'-[?-.] -[i] ""»" 

!, h dt !, ' he ordei . > id 

interchanged 1 i pci rema 

same as (21). The dual problem was investigated by KipinirJi 

[241. 

Transactions o( the AS: 1 


Fig. 3 Example 3s Block 



diagram of duol problem 


.rated by putting white note 
- fufn). The block diagnti 


■a "A 


Hi i i . i i on the tv > ex i] 

"starting" assumptions and in the i>h 
Example S. Following Sbinbrol " 


given by 
The m 
is defined 


ion el . ' ' 1 j ' 

i draw iiig tin isj i 

id state variable* are interchanged, see Fig. 6. Evi- 

j - ami r\ - . i : , t > nc itui . , i . 


issage model for the 


. two examples is L) itj st 


■ B lie in the nature of the 
ed signals. 

i\: consider the following 
. time to - 0 with a fixed 
but. unknown velocity of zero mean and known variance. The 

additive white noif 
tion and velocity. 

The verbal description ot the pi lei in t th.it />u{0) = 
PsiifO) = 0, pai(0) > 0 and q a = 0. Moreover, G => 0, H = 
(501. See Fig. 7(a). 

}t isml of tS rohlem boi hat imt I t calls for 
minimizing the performance index 


We are U 



•)lWr' (f <0) 


. the 


i 


words: We ore given a transfer tan 
the time-interval [< *, 0] should be selected in such a way as 
to minimize the sum of (i) the square of the velocity and (ii) the 
control, energy. In the discrete-time case, this problem was 
treated in [11, Example 2J. 

Example 8. We assume here that the transfer function l/t* is 
excited by white noise and that both the position as, and velocity 
xt can be observed in the presence of noise. Therefore (see Fig. 


fig. 6 Example 4: Block diagram of dual problem 



Fig. 7 Examples: Block 


procetj and optimal filter 
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us pro! i - > in < Hti ■'!,<> i > 

rhe dual problem is i nilw to Kx« 5 

7 Summary of Results: Mathematics 

Herew esent ttm i r > * < '• ! i.athe- 

U t Sttl iHiandmgof the 

problem, the rig iroo te compf 

requiring meed a i > I < - they arc to be 

found in Sections 8-10. After reading this section, one may pass 
without loss of continuity to Section 11 which contains the solu- 
tions of the examples, 

(1) Canonical form of the optimal filter. The optimal estimate 
£(£ji) is generated by a linear dynamical system of the form 

di{t\i)/dl = T{t)i(t\i) + K<0i(«|<) 

«(i|0 - z{i) - H«i(«|l) 

Iheini i < l is zero 

i or o| th ial t ipol! tio m a< J U« i la i n 

- *(fc. 08<*|0 (fi S 0 . (V) 
No similarly simple formula is known at present for interpolation 
(li < <)• 

ill < t of (I) a » i .,,,,< The 

variables appearing in this diagram are vectors and the "boxes" 
> represen t i i tUng <w ' 1 s ir 

r nutalivity of i ion) such gcnerali 

block diagrams are subject to the same rules as ordinary block 
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diagrams Tin ' i > - . ica k <ib . < f sig 1 flow scr 
as a reminder that we. are dealing with multiple rather than 
single signals. 

'-piil. r ! a 1 ii it is obtained 

king a copy of thi lei of the m< ag< , p si tting i 
it i • I, forming th igna! i((\t) and f< 

ing the error forward with a gain K(t). Thus the specification of 
the optimal hlter is equivalent 1<i the computation oi the optimal 
time-varying gains K(t). This result is general and does not de- 
pend on constancy of the model. 

(2) Canonical form for (he dynamkal mplem gamming the 
optimal error. Let 

- x«) - x(t|<) (22) 


the 


whic 


error, 5(t|i) ia governed by the * 
di(t\i)/dt = F«>sc*|0 + G(i)ufi) - K(<)M<) 

+ HU)i(t!t» (H) 

See Pig. 10. 

(3) Optimal gain. Let us introduce Hie abbreviation; 

P(0 = covfx(ilt), i(M)l (23) 
Then it can bu shown that 

K(0 - P(i)H'Ci)R-'(0 • <ni) 

(4) Farfance equation. The only remaining unknown is P<<). 
It can be shown that P(t) must be a solution of the matrix dif- 
ferential equation 

dP/dt - F(i)P + PF'(i) - PH'«JR-'(OH«)P 

+ G(0Q«)G'{() (IV) ; 

Transactions of the «M£ 


This fa the variance equation; it Is a system of n(» + l)/2* non- 
linear differential equations of the first order, and is of the RiccoM 
type well known in the calculus of variations (17, 18]. 

solutions of the variance equation. Given any 
and a nonnegative definite matrix Po, (IV) has 

solution 


(5) Existence 
fixed initial 


P«) - Xl(t; P 8 , ft) 


(34) 


defined for ail \i — h\ sufficiently small, which takes on the value 
P(fe) - Ps at i - fe. This follows at once from the fact that (IV) 
itisfies a Li| I 1 

Since (IV) is nonlinear, we cannot of course conclude without 
< t , g t tion P(i) exists for nil t (21 > 

taking into account the problem from which (IV) was derived, 
! ever, si i es »i P(f) I) is del for all i ' 

These results can be summarised by the following theorem, 
which is the analogue of Theorem 3 of [11] and is proved in 
Sections: • ^ 

FHEOREM ' <',>'■ ' imption • < \~ \, h (he 
solution of the optimal estimation problem with U > — » is given by 
relations (I-V). The solution P(<) of (IV) it uniquely determined 
for all t g It by the specification of 

Ps - cov[x(t.), x(fe)l; 
knowledge of P(() in turn determines the optimal gain K(t). The 
ntfiaistntt * <mai filler ts 0. 
<0) Variance of the estimate of a costate. From (23) we have 
i t i i la for (15): 

Six*, x(!it>!« - HxiVco (25) 

(7) Analytic solution of the variance equation. Because of the 
Hose relationship between the Pticeati equation rind the calculus 
of variations, a closed-form solution of sorts ia available for (IV). 
The easiest way of obtaining it is as follows [17] : 

Introduce the quadratic Hamillonian function 

3C(k,w,0= -O/aflGWrJftaw 

- w'F'( { )x + (V»)i|H(f)w|h-«» (26) 
and consider the associated canonical differential equations 

d*/dl - d3C/dw» - -F'(«)x + H'(<)R->(<)H(0w ] 

I (27) 
dw/dt - -d3C/dx ~ G(i)Q(0G'(0x + F«)w J 

We denote the transition matrix of (27) by 


P(0. 


This is the number of distinct elements of tho symmetric m 


In Section 10 we shall prove 

THEOREM 2. The solution of (IV) for arbitrary nonneaaivie 
definite, symmetric P* and all t&t»can be represented by the formula 
mt; Po, h) - [<5>«0, fc) + ««ft MW • t®u(i, k) 

+ ©.»(*, fo^.]- 1 (29) 

Unless all matrices occurring in (27) are constant, this result 
simply replaces one difficult problem by another of similar dif- 
ficulty, since only in the rarest cases can &(t, i») be expressed in 
analytic form. Something has been accomplished, however, since 
we have shown that the solution ofnonamstant estimation problems 
involves precisely the same analytic difficulties as the solution of linear 
differential equations with variable coefficients. 

(8) Existence of steady-stale solution. If the time-interval over 
which data are available is infinite, in other words, if k - - <*>, 
rem 1 ib i ion > 1 mo further restrict 

For instance, if H(t) aa 0, the variance of i is the same as the 
variance of x; if the model (10-11) is unstable, then x(0 defined 
by (13) does not exist and the estimation problem is meaningless. 

The following theorem, proved in Section 9, gives two sufficient 
conditions for the steady-state estimation problem to be meaning- 
ful. The first is tho one assumed at the very beginning in the 
convnntio lal Wit , • th >f> H <■ second c< nditio i, whi :h we n- 
troduce here for the first time, is much weaker and more "natural" 
than the first; moreover, it is almost a necessary condition as well . 

THEOREM 3. Denote the solutions of (IV) as in («)■ Then 
the limit 

Jim mt; Q,k) = PW (30) 

exists for all t md is a solution of (IV) if either 
(ki) the model (10-11) is ■uniformly asymploticaUy stable; or 
(A,') the model (10-11) is "completely observable" [171, thai is, 

for all I there is some Ut) < I ««* that the matrix 


»Th« notation o3G/dw rt 


M(fe, 0 - f* *'(T, <)H'(r)H(r)*(r, t)dr 

is positive definite. (See [21] for the definition of unifon 
stability.) 

Remarks, (g) P(£) is the covariance matrix of the optima! error 
corresponding to the very special situation in which (i) an arbi- 
trarily long record of past measurements is available, and (ii) the 
initial state x(fe) was known exactly. When all matrices in 
(10-12) are constant, then so is also P— this ia just the classical 
Wiener problem. Ia the constant case, P is an equilibrium 
state of (IV) (i.e., for this choice of P, the right-hand side of (IV) 
ia zero). In general, P(0 should be regarded as a moving equi- 
librium point of (IV), see Theorem 4 below. 

(h) The matrix M(&, I ) is well known in mathematical statistics. 
It is the information matrix in the sense of It. A. Fisher {20] 
corresponding to the special estimation problem when (i) u(0 ss 0 
and (ii) v(i) - gaussian with unit covariance matrix. In this 
case, the variance of any unbiased estimator fi{t) of [x,* x(i)] 
satisfies the well-known Cramer-Rao inequality [20] 
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etM(o - mm * s IMI'm -hi* » (32) 

<• Every costate x* has a minimum-variance unbiased estimator for 
which the equality sign holds in (S£) if and only if M it positive 
definite. This motivates the use of condition (A/) in Theorem 3 
and the term "completely observable." 

(«) It can be shown {17] that in the constant case complete 
observability is equivalent to the easily verified condition; 

ranklH', F'H', . . ., (F')»-'H'| - n (33) 

where the square brackets denote a matrix with n rows and np 
columns, 

(9) Stability of the optimal filter. It should be realized now that 
the optimoMy of the filter (I) does not at the same time guarantee 
its stability, The reader can easily check this by constructing an 
» . , i >r install > i \ 1 • s l sonsiste 
interacting «ytf* es t l sufficient lit s 

for stability entails some rattier delicate mathematical technicali- 
ties which we shall bypass and state only the best final result cur- 
rently available. 

First, some additional definitions. 

We say that the model (10-11) is uniformly completely ob- 
servable if there exist fixed constants, a s , a,, and <r such that 

a,j|x*j|» £ |i*i|*M(»-».<) S «4|x*li s for ail x* and f, 

milarl e m it a model m I reliable • , 
n pi rottabU] if U dual i lei is complete! 

eervable [uniformly completely observable]. For a discussion of 
i , s 1 i r-f cr to [ 1 t!i ( 1 1 

that toe property of "uniformity" is always true for constant 
systems. 

We can now state the central theorem of the paper : 

THEOREM 4- Assume that the model of the message process is 

{At") uniformly completely observable; 

(A») uniform I < ' 

(A,) or, £ Q(i)S| S a*, a, S |!R<0|1 S a. for all f; 

(A,5 |!F(i)H 5 a,. 

I'Ae» ifc« following is true: 

(i) 77i« optimal u <■ o fally stable; 

(ii) Every solution U(t; P,, fc) o/ (A« wianw df^wrn (IV) 
tarling , • > tit P • f> t 

{defined in Theorem $)ast-* <b. 

Remark*. ] A 'ill i which is not uniformly t> i) >ii >1 
stable w ' 1 1 it |21 

practical i limiUd 

(k) Proper t > in Tl >rcm 4 i >f centra! im] rt c - i. < ' 
shows that the variance equation is a 'Vat ' i >u> t i n! 
i i d i t i to round 


with P. Note, however, that the procedure may fail if the con- 
ditions of Theorems 3 and 4 are not satisfied. See Example 4. 

(11) Solution of the Dual Problem. For details, consult [17J.'; 
The only facte needed.here are the following; The optimal con- 
trol law* is given by 


134) 


1 1 wept ' > g e of Hi) to I can be estimated 
l <_'■'>.> ( 1 ). 

(.10) Solutumof the classical Winner problem. Theorems 3 and 4 

vitsft i mined wollary; 
Till OR! M is veins $ and 4 

■ at F, G, H, Q, R, are constants. 
Then, if k = - «>, the solution of the estimation problem is ob- 
taincd ii < i i - > and solving 

the resulting set of quadratic algebraic equations. That solution 
which is nonnegaliv* definite is equal to P. 

th rve thi y the assumption of eon- 

P(0 i stai < i i 4, s i of (iV) 

starling at unegativi „ • ■ <- converge to P J f ' e, if a 
matrix P is found for which the right-hand side of (IV) vanishes 
and if this matrix is nonnegative definite, it must be identical 


«♦(*«) - -•;*((')«(<'! 

where «*(**) satisfies the duality relation 

K*(<*> - K'(0 (35) 
and is to be determined by duality from formula (III?. The 
value of the performance index (20) may be written in the form 

min V(x*; <*, k*, «») - «*.«i> 

where ll*(t*; x*, U,*} is the solution of the dual of the variance 
equation (IV). 

It should be carefully noted that the hypotheses of Theorem 4 
are invariant under duality. Hence essentially the same theory 
covers both the estimation and the regular problem, as stated in 
Section 5. 

The vector-matrix block diagram for the optima! regulator is 
shown in Fig. II. 



(12) Computation of the covariance matrix for the message process. 
To apply Theorem 1, it is necessary to determine cov (x(fe), x(fe)] . 
This may be specified as part of the problem statement as in 
I , e er hand, one mist! 1 < 

. rca< 1st ysta e(A • ' 1 
(13) and (12) we have that 

S(i) - cov ix«), x(l)l = f'_ m *«, r)G(r)Q(r)G<(r)«!'U r)dr 
provided the model ( 10) is asymptotically stable. Differentiating 

11* 1 f > ' r 1 

fcrcntial eomatioti ior Sit) 

<G>ldt = F(()S + SF'(0 + GWGKOGV) C38; 
This formula is analogous to the well-known lemma of Lyapucov 
! . vaiuating th ntt gi i qu • ' < < ' '> ' ' T 
, r ij, am In case < • .stant =y??om, (36) reduces 

to a system of linear algebraic equations. 

3 Derivation of the Fundamental Equations 

p i j i t r " 

t al equat i Diffct tiat , 1 t *o Umc ami 

then using (10-11), we obtain in 
mental equations of our theory. 

Much cumbersome manipulation of integrals cs 
recognizing, as has been pointed out by Pugachev [27}, that the 
i ier-Hopf equation is a specia se ol j ' ometr 

principle: orthogonal projection. 

Consider an abstract Bpace 3C such that an inner prodnet (X, Y ) 
is defined between any two elements X, Y of ST. The norm i« 
defined by \\X\\ - (X, X)' fy . Ut be a subitpsee of SC. *e 


a very simple way the funda- 
n be avoided by 
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seek a vector U* in 'U \s Kt -h mi . mues ''X - f/|| with r :epe it 
to any V ia <tt. If such a minimizm? vector exists, it may he 
.... . 

UKTH ON'AL VUU.M 11 n MMA \ - i 
|l* - UiMaU l : m if anHii) only if 

(X - U„ U} = 0 for all £/ in 01 (37) 
(iii) Moreover, if there is another sector W satisfying (37), Am 

• (. ; it 

, j < < , 1 

- ^ fix - EMS'' -i- 2{x - i/., r;, - P) + 11*' - cMI* 

Since 11 is a linear space, it contains V - V,; hence if Condition 
t , • r.ji.i-l..'«a herefore S!X S 

— ; j !..,•' iii o i bviotts. 

' (ii) Suppose there" is a vector t/» such that (X - lh, U s ) - a 


- U, 


■■ j!x - {/4| s + 2. 


WIKS 1 ' ' ' ' 

{ » , * j* » < » ' 1 ' ' 

„, ,. < , . . »' f ' i ' 

n \ ,,-tioa A«„ r) snifs/fc «J<« r«iatton 

covfxfi,), »(<r)l - Jj' A-.£„ r) eov(.(r), *(ff)]dr - 0 (3S) 

or equim'.eiiUy. 

cov{S{M0,x(«)] - 0 (39) 

for ait fc> £ » < t- 
COKOLLARY. cov{i((,!i5, Ht,\t)\ = 0 (-50) 
Proo/. Let x* be a fixed costate and denote by ft the space of 
„" scalar random variable U\ x(h)} of zero mean ai 
variance. The iwvr product ia defined as (X, Y) = 8(x*, 

X !,(, x* > ,) ti 
variables of the type 


For the moment we assume for BimpHeity that f, - Differen- 
tiating (38) with respect to t, and interchanging 6/d< and fe, we 
get for all t, £ o< t, 

~ oov[k(0. x«r)l - RQ covtx(f), x(a)l 

+ GK) covIu(t:, stf»J (4D 

— f A((, t) covWt), *(<r)]dr 

m A £ A(«,r) coviy(T), v(«)W + £ A((, «r/R(<r) 
„ £ •— A{i, T) covWr), »(cr:lrfr 

+ A«, () cov Sy(0, V(^)l « 2 > 
The last term in (41) vanishes because of the independence of 
u(0of v(cr) and *{<r) when a < I. Further, 
eov[y«), y(<01 - H(0cov(x(0, *(tr)] - eov[y(l), v(<r)l (43) 
As before, the last term again vanishes. Combining (41-43), we 
get, bearing in mind also (SS), 


V - I**, «(f,)| 


MrWr 


f B v .'i, i tinuoufly difft 

.-t. .r. .tmul>, i\f vno; / , n.r t..e estimate [x*. x(2i|01. 
We now apply the orthogonal projection lemma and find that 
condition (37) takes the form 

(X - Oh V) - S W*. S(h|<M»*. U <«1 
= ** C ovlxO,jO,uO,)]x*' 

Interchanging integration and the expected value operation 
c i . «bie in view of the continuity assumptions made, under 
(A,). see f2SS), we get 


X _ v„ V) - «• jj] f 'cov|t( (l !0,t( ff )lB'«„ 


-Ms- 


J'[R0A(f.T)- | A(l,i 


ITiis must vw for all x fctfficiencj 

obvious. To prova the necessity, we take B(ti, <r) — cov[i(<ilt), 
x(<r)l. Then 8B' is nonnegative definite. By continuity, the 

egra!«i > * unless BD > < 

B(t J( <r) vanishes identicaily for ail (o g o- < £. The Corollary 
follows trivial! s itiplyin«(3u)oi the right by A U, <r) and 
integrating with respect to or. Q.B.D, 

JJraari-. (m) Equation (39) does not hold when <r = t In 
fact, cov rtfiji), x (01 - PM K (0 R CO. 
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- A(t, <)H(£)A(<, 


,4 


for all k S » < «■ This condition is certainly satisfied if the opti- 
mal operator Aft, r) is a solution of the differential equation 

F(t)A(«, t) - ~ - A(i, r) - A«, 0HC0A«, t) = 0 (45) 
for all values of the parameter r lying in the interval S r £ !• 

' • f !/ 2' , ' r r , ■ C'l 

, , i.. fut ' t B / T j 1 the br. ' r . . r! 
If A(«, r) Bflti Wiener-) fopf < t ' ' " ' ' 

given by (14) is an optimal estimate; and the same holds alBO for 


ay|o + f t l w, r)x(T>iT 


since by (45) A«, r) + B(t,r) also satisfies the Wiener-Hopf equa- 
tion, But by the lemma, the norm of the difference of two opti- 
mal estimates is zero. Hence 

x* J j]' Bit, r)covMr), x(t'W{1, r')drdr'\ x*' - 0 (46) 

for ail x*. By the assumptions of Section 4, y(r) and v(t) are 
M cktod md tl i S in 

cov[x(T), x(T')l - R(r)5(r - r') + coviy(r), y(r')] 
Substitutang this into the integral (48), the contribution of the 
econd the nghr i egat He tl < 

he first term is positive unless (45) hoi f ' 1 

definiteness of R(r)), which concludes the proof. 

respect Ir ' — ' • 


" f "m Aa TWrVir + A(t ' 


Using the abbreviation A(«, t) = K(<) as well as (45) and (14), 
n t ,t in ai nee the differ it al i ju Uim of U .pt m»l '«" r 

dim/dt - R<)8(*) + WOWD - H(0S«N)1 (1 > 
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Combining (10) and (1), we obtain the differential equation for 
the error of the optimal estimate; 

aW\t)/dt - [HD - K(0H(f)i*(»l0 + G«M0 - K('M0 («) 

To obtain an explicit expression for K(t), we observe first that 
(39) implies that following identity in the interval U X o < t: 

cov (x(0. Y<«01 ~ P A«, t) cov Mt), y(<r)]dr - A<«, o-)R(ff) 
Jfc (39') 

Since both sides of (39') are continuous functions of <r, it la clear 
that equality holds also for <r - t. Therefore 


• The only p 


atbe -proof of Theorem 1 


K«)R(0 - A(t, l)R(i) - cov[x((|(), y(01- 


v [x(t|0, x(t)]H'(0 


By (40), we have then 

- cov [*(i|0, x(tji)|H'(i) - P(0H'(t) 
Since R(() is assumed to be positive definite, it is invertibie and 
therefore 

KM - P(t)H'(0R-'«) 

We cars now .* T , . In i quation T v l*0, r) be the 

common transition matrix of (I) and (11). Then 
P(t) - U) 

- 6 j*J M'U r)(G(T)u(r) - KfrMr)!^ 

X l»'(<r)G'(«0 - »'(»)KW]'l"ft 

Using the fact that u(t ) and v(i) are uncorrelated white noise, the 
integral simplifies to 

- X« ^ T)lG(r)Q(T)G'(r) + K(i-)R(r)K'(T)lV(t, r)dr 
r itu i III), weobt 

easy calculations the variance equation 
dP/di - F(«P + PF'(t) - PH'(t)R~K«)H(t)P 

+ G(i)Q(i)O'(0 (IV) 

Alternately, we could write 

dP/fi = dcov H, i]/dt - cov \difdi, i) + cov [x, dx/di] 
and evaluate the right-hand side by means of (II). A typical 
covariance matrix to be computed is 
cov |x«|f), u(01 

- cov [f' t mt, r)IG(r)u(r) - KCrMr»4r, ««)] 

= ('A)G(J)CXt) 
the factor '/« following from properties of the ^-function. 

To complete the derivations, we note that, if tx > t, then by 
(3) 

x(t.) - *(»«» i)x(0 + /J T)»(r )<fr 

Since u{r) for j < r S is independent of x(r) in the interval 
(, S r ^ (, tt follows by (38) that the optimal estimator for the 
right-band side above is 0, Hence 

Ht.\t) - *(<., 08(«|O «> fc 0 (V) 


tennin the initial conditions for (IV) From (38) it is dear t 

Hence ■ T 1^.' 

P 8 -P(8o)=xovlx(l,H i C f »Nl 
- covWh), x(fc)l 
In case of the conventional Wiener theory (see (A,)), the last tec 
is evaluated by means of (36). 
This completes the proof of Theorem I. 

9 Outline of Proofs 

Using the duality relations (16), all proofs can be reduced 5 
those given for the regulator problem in 1171 . 

(1) The fact that solutions of «ie variance equation exist fc* 
alll is disproved to 117, Theorem (t using the fact thai 
.;„..,;,. •;•«:. must bo finite in any finite interval (fo, <). 

(2) Theorem 8 Is proved by showing that th « tea part 
lar estimate of finite l 

ed in i rhi-orem (»..« 
modification of this proof goes through also with aasumptvv 

( ^3) Theorem 4 is proved in (17; Theorems (6.8), (6.10), (7. £ ; 
Th« stability of the optima filter proved by noiwg U 
> error plays ti v tenet™ 

f the variance < t i« W > J> 1 

Lyapunov function for P. This Lyapunov function ino,. 
simplest case is discussed briefly at the end of Example 1. V, hh 
,! ,, , true alsf the nonconstant case, at pre** 
must impose the somewhat restrictive conditions (A, - A?}. 

111 Analytic Solution o! the Variance Equation 

Ut X(i), W(t) be the (unique) matrix solution pair for te 
which satisfy the initial conditions 

Xffe) = I, Wfi,) - P. 0 


Then we have She following identity . • 

W(<) = P«pt(f), < fe <« (' 

which is eafliiv verified by substituting (48) with (IV) into 

On thr- other hand, in » of (47-4 

the first set of equations (27) that X(i) is the transition u> 
of the differential equation 

dx/dt - -F'«)x + HV)R-WH«*Wx 
wMch is the adjoint of the differential equation (IV) of 
optimal filter. Since the inverse of a transition matrix «L- 
exists, we can write 

; P(0 - W(tjX-HO, <a«« <■ 

This formula may not be valid for t < t* for then P(f) may 
^Only trivial stops remain to complete the proof of Tbeorei. 

11 Examples: Solution 

Example 1. If o„ > 0 and r„ > b, it is easily verified ttei 
conditions of Theorems 3-4 are satisfied. After trivial , 
stitutions in (IH-IV) we obtain the expression for the op- 


The same conclusion does nof follow if h < t 

independence bedween x(rj and u(t). 
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© of lack of 


MO - Mi)/'* 
i equation 

dpu/<U - tfnPtt ~ P>»'/ru + flii 
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steady-state, let 


aiy problem (i.e., to = - «>» see (A,)): 

p«= [/,.+ V/u' + in/rujru (82) 
Since ^, and n» are noBiie8ative.it a clear Oiat only the positive The differential equation for fetb' 


5pu(f) - Pu(0 ~ 


« R u is permissible in front of the square root, 
f Substituting into (SO), we get the following expressions for the 

optimal gain 

kn = /« + V7..» + W'» < 53 > 

„ , f ,>, i tesin ! i itrix (i.e., reciprocal time 
constant) 


dSpu/di - 2/,i«p !t - (SpaWni 
We now introduce a Lyapunov function (21) for (56) 

V(*J>n> - 0Pn/P>tT 
The derivative of V along motion* of (51 ) is given by 


% - fa - £u = -Vjfu» + W'n < 5 « 
of the optimal filter. We see, in accordance with Theorem 4, that 
; the optimal filter is always stable, irrespective of the stability of 
j the message 
s optimal filter. 

It is easily checked that the formulas (52-S4) agree with the re- 
sults of the conventional Wiener theory [29] . 
Let us now compute the solution of the problem for a finite 
j interval (f. > -»). The Hamiltonian equations 


(27) it 


. - -2ip,,/r„ + 9u/P..W«Pu) (57) 

model. Fig. 1(6) shows tte configuration of the This shows clearly that the "equivalent ^^^"^ 
for the variance equation depends on two quantities, (i) Hie 
message-to-noise ratio Pn/ru at the input of the optimal filter, 
(ii) the ratio of excitation to estimation error qn/pu. 

Since the message model in this example is identical with its 
dual, it is clear that the preceding results apply without any modi- 
fication to the dual problem. In particular, the filter shown m 
Fig. 1(6) is the same as the optimal regulator for a plant with 
transfer function l/U - /«)■ The Hamiltonian equations (27) 
for the dual problem were derived by Iteonoer [19) from Pon- 
tryagin's maximum principle. 

Let us conclude this example by making some observations 
.„„t. tl«> nnnconatant case. First, the expression for the 


dx u >fU - -/us, + (l/r u )»i I 


Let T be the matrix of coefficients of these equations. 

To compute the transition matrix ©(£, <a) corresponding to T, a jj 0ut nonconstant case. First, the expresaroi 

we note first that the eigenvalues of T are ±Ju- Using this fact derivative of the Lyapunov function given by l! 57) remains 
ind constancy, it follows that " * '* " * 


., 8(1, fe) - exp T(< - fc) - G exp (t - U)J„ 

+ C,exp [-(£ 


where the constant matrices C, and C» 
the requirements 

€Kfe, W = Cj + C, - 1 - unit matrix 

tie«, r.)/*U % = ma, t,)U % - J,,c, - j»Ct 

After * good deal of algebra, we obtain 


without any modification. Second, assume *>«(*»> has been evalu- 
ated somehow. Given this number, PnW can be evaluated for 
t & h by means of the variance equation (51 ); the existence of a 
Lyapunov function and in particular (57) shows that this compu- 
miqueiy determined by tation is stable, i.e., not adversely affected by roundoff errors. 
1 y * Third, knowing p n (t), equation (57) provides a clear picture of 

the transient behavior of the optimal filter, even though it might 
be impossible to solve (51) in closed form. 
Example S, The variance equation is 


ooA/nr 


Knowledge of #(.', k} can be used to derive explicit solutions 
to a variety of ooturtaUonary filtering problems, 

Wt> consider only one such problem, which was treated by Shin- 
brot (3, Example 2). He assumes that/u < 0 and that the mes- 
sage process has reached steady-state. From (39) we see that 

SxAO - -W3fu for all t 
Wl r.-wome that the observations of the signal start at i = 0. 
Sine* '*he eaymatea must be unbiased, it is clear that A(0) - 0. 
"therefore 

) Mm - - z*m - -wsfc 

^instituting this into (55), we get Shinbrot's formula: 

. m _ „ r (/ " - J-* 7 "' ^ (/.. + 7»>- ?l " I 

p,,<0 ~ 9u L-(/» - hM>< + (/., + /»}»«-*" J 

Since /„ < 0, we see that as I — ► <», j» u (f} converges to 

pu - -?«/(/,» + J..) - (/u - 
hmt\ a! Basis Engineering 


rfpu/rff - 2/npu - pu'O/n. + Vr») + ?.« 


If ?» > 0, f,t > 0, and r» > 0, the conditions of Theorems 3-4 
are satisfied. Therefore the minimum error variance in the 
steady-state is 


fa_ + \ / 7» i + luf ra + On/fa 
P " = " i/m + l/r„ 

and the optimal steady-state gains are 

• £>< - J>uA«, * » ». 2 
The same problem has been considered also by Westeott [30, 
Example!. A glance at his calculations shows that ours is the 
simpler and more natural approach. 
Example S. The variance equation is 

dpu/di = -pn*/rn + 9.1 1 

dpn/dt - p., - Pts -Piipn/r.i | (58) 

rfp«/(i« - 2<p« - pk) - PttVni J 
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If qn > 0, r„ > 0, the conditions of Theorems 3- i arc satisfied. 
Setting 0 |»u»l to «w, w« get the solu- 

tion of il rohlen 

&■ => V g»/fu 

tn - i Vi :\ 1; 


Is oi i < 

0 0 1/rn 


See Fig- 3(6). 
The infinitesimal ti 
, 1 into i? 


ratio is (V«)I2 + (r,,/. =1 
problem, the f >ar« meter.* of (he 
by inspection. 

The solution of the dual pro!: 


x os (.he optimal filter jr 


g (29), we find ((. - 0): 


• - . " L* 


v Kiiiiisiitk 1 24], using the Elder 


find, f. 


p,, (/, f„) V 7 Vu t Ifc/nr -„ 


dpu/cfc - 2/,!pi5 - pn'/ru -f in 
#is/rfi - tei + /w?»m - p. W« ) (59) 
<W<" = 2/«p„ - p„Vr„ 
If/,, 5-* 0,/,, t< 0, mid r, f > <>, i'he rendition* of Theorems 3-4 
ar* Kittened. 1 hen: are then two seta of possibilities for the right- ^-(j, T ) - 
bend side of fall) to vanish for rromiewsiivo p 2; : 

(A) />« = VWi (B) fm = Vl',0 t l/, 8 /V,»u>>.i 

p, 3 - (1 pis - 2/sir.i 

Pn - -</»//>*) VW.' 
The expression for p M shows that Case (A) applies when f tt fn is 
negative (the model is stable but not asymptotically stable) 

„1 I ft « i the model i« -1 I 1« 

The optimal filler is shown in Fiji. .,{?,). The optimal gains arc 
given by 

fit = Pu/r,„ fet - Pn/ru 
If/,* ft 0 but /si - 0, the model is completely olwprvid.!- but 

not eompletcly controllable. Hence the steady-state v-erifmee- 
ist • < r ' > i 

since Theorem 4 is not applicable. As a matter of fact, the 

pti Ifilier ill th I 1 ! !' 1\ ' I Of ;m\ it in not 

symptoticitUy st«b!e. 


Tins formula, obtained here with litt 
results of Shinbort (3, Example 1 J. 

e o filter U shown i 1 1 ; 

gains tend to 0»!-» «; in other v 
less attention to the iiieominji FiRiiab 
the previous estimates of xt end 

Since the conditions of Theorem - 
suspect that the optimal filter b h 
ponentially 121]) asymptotically sti 


■ ealeuS 


2 0, 


Since i/.'nfr 
, le; i that 1 
totscaliy sta 


If/,, 


if /., 


Thus if /„ - 0, the -„..!■-.... 
F.r'mph 5. The variance e 
dpn/df «= 2 

<Wdt = p 
«*M = - 
Wc assume that n, > 6; thi 
Me. Wc thcu fisnj that the f 
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q<lilih 


pOSlt ivi 


' ; if r , 
definite 


output 

4>-M, r)k„{r) -1 W, * • r t f ,,, 

This agrees with Shinhrot's result 

Example. 0. The variance equation is: 

#i./<ft = 2ptt - Wit ~ firu'pt-'.-r-j j 

<W<& = P» - /.,•>.,».- ' . - >■ */'••)'-« ' 

- -AoWMi - *it*F*V r » + *' 1 

If ft,, yt 0. g„ > 0, • ( ». • - - 0 (f '-" ' '»" ■ 

Theorems 3-4 are sate.fi i 1 u 11 ' " 

„. ,1 o, , ,„ \,--,<U b> r v. iv „.'i fed -dp bi lie problem. We 
itiirodiiee first the abbreviations' 


:,~ ileii. Theorem :t is applica- 


a = (An 


Ttansactions of the ASMt 


- Pa = 


vV«";»-> 


Itise-a ; overi 5 tl d I., , >« 1 .- i side ot (60) vanishes for 
Shis set of Pi/e; by Theorem 5, this cannot happen for any other 
mt. Hence the solution of the stationary Wiener problem is com- 
plete. It is interesting to cote that the conventional procedure 
would require here the spectral factorization of ft two-by-two 
matrix which is «;ry much more difficult algebraically than by 
the present method. 

i! !*i transit theoptin 1 filter is given 


-jjt J 


« + a + |S» 

! he natural frequency of the optima! Sitter is 

« - lMf»* t )\ - "v/oT 

and the damping ratio is 

f-|ReX( Fw .,!/«--^^ + ^ 

wt seta » egarded a t'igi . > in ise ra - 

• . 1 nuaeters of tl \ r depend mij « n these 

ratios, there is a possibility of building an adaptive filter once 
1 ' f I ■ ' 1 - > 1 aikble. An in- 

1" tl bvBuev! 11 

(fed tt* when A,, -0-0. 

12 Problems Related to Adaptive Systems 

f i« lit >uid be of considwabi tsi 

' it Li ! ,1 

,>;<.• 1 1 . 1 . 

An adaptive system is one which changes its parameters in ac- 
es in 1 environment. In the 
, t is rcfteo 

time-dependence of F, G, H, Q, R. Our theory shows that such 

t ' . 1 1 i j ters but not ti 

structure of the o 1 ia is what on 

1 1 > . ilt-o h rigorous proof. Under ideal 

'be cl s ironment could be detect 

msUuitawwirfy ami exactly. The adaptive fiber would then 
s (I-IV). In 


(1) The theoretical aspect. Here interest centers on: 

(1) The general form of the solution (see Fig, 1). 

(ii) Conditions which guarantee a priori the existence, physical 
readability, and stability of the optimal filter. 

iii Chat terizatioi f the general results in terms of some 
simple quantities, such as dignal-to-noise ratio, information rate, 
bandwidth, etc. 

An important consequence of the lime-domain approach is that 
thi f mi r t m 1 npletely divorce ' e a» 

sumption of stationarity which has dominated much of the think- 
ing in the past, 

(2) The computational aspect. The classical (more accurately, 
old-fashioned) view is that a mathematical problem is solved if 
the solution is expressed by a formula. It is not a trivial matter, 
however, to substitute numbers in a formula. The current litera- 
ture on the Wiener problem is full of semirigorously derived 
formulas which turn out to be unusable for practical computa- 
tion when the order of the system becomes even moderately large. 
The variance equation of our approach provides a practically 
< eful and theon eailj 'clean" technique of numerical computa- 
tion, s ft guarant > ergenc thes quatioi 

e com 1 i 1 , 11 sidered solved, except for 
purel) numerical difficulties. 

Some open problems, which we intend to treat in the near 
future, ares 

(i) 1 ^tcnbion of the theory to include nonwhite noise. As 
mentioned in Section 2, this problem is already solved in the dis- 
ie case (HI, and the only remaining difficulty is to get a 
tt canonical form in the continuous-time case, 
(ii) General study of the variance equations using Lyapunov 

(ill) Relations with the calculus of variations and information 
theory, 
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